Query: Adds ability to choose global vs local/focused statistics for FullTextScore by neildsh · Pull Request #5582 · Azure/azure-cosmos-dotnet-v3

neildsh · 2026-01-30T00:43:10Z

Enabling users to choose global vs local/focused statistics for FullTextScore

Why?

Cosmos DB’s implementation of FullTextScore computes BM25 statistics (term frequency, inverse document frequency, and document length) across all documents in the container, including all physical and logical partitions.

While this provides a valid and comprehensive representation of statistics for the entire dataset, it introduces challenges for several common use cases.

In multi-tenant scenarios, it is often necessary to isolate queries to data belonging to a specific tenant, typically defined by the partition key or a component of a hierarchical partition key. This enables scoring to reflect statistics that are accurate for that tenant’s dataset, rather than for the entire container. For customers such as Veeam and Sitecore, which operate large multi-tenant containers, this is not just an optimization but a requirement. Their tenants often operate in very different domains, which can significantly change the distribution and importance of keywords and phrases. Using global statistics in these cases leads to distorted relevance rankings.

In other scenarios involving hundreds or thousands of physical partitions, computing statistics across the entire container can become both time-consuming and expensive. Customers may prefer to use statistics derived from only a subset of partitions to improve performance and reduce RU consumption. Indeed, there is precedence for this as Azure AI Search defaults to this “local” method.

What?

We propose extending the flexibility of BM25 scoring in Cosmos DB so that developers can choose between a global FullTextScore (existing behavior) or Scoped FullTextScore (statistics computed restricted to the partition key(s) used in the query). The key aspects:

For global BM25, FullTextScore retains its existing behavior and computes BM25 statistics, such as IDF and average document length, across all documents in the container regardless of any partition key filters in the query. In scoped BM25, when a query includes a partition key filter or explicitly requests scoped scoring, the engine computes these statistics only over the subset of documents within the specified partition key values. Query results are still returned only from the filtered partitions, and the resulting scores and ranking reflect relevance within that partition-specific slice of data.

How?

The user issues query like:

SELECT TOP 10 * FROM c   
WHERE c.tenantId = @tenantId   
ORDER BY RANK FullTextScore(c.text, "keywords")

And sets a new QueryRequestOption called FullTextScoreScope which can be set to one of two values: local or global. The request option is inspected, and the query uses scoped/full stats accordingly.

Type of change

Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)

adityasa · 2026-01-30T01:04:52Z

Is it possible to add some emulator based e2e tests?
One validation that is of interest is partition filtering based on :

query filter
QueryRequestOptions

In both cases, the query should honor FullTextScore Scope (local v/s global).

adityasa

sboshra

…stics for full text search

…s for FulTextScoreScore.Local

adityasa

neildsh requested review from FabianMeiswinkel, Pilchie, adityasa, khdang, kirankumarkolli, kirillg and sboshra as code owners January 30, 2026 00:43

neildsh added QUERY auto-merge Enables automation to merge PRs labels Jan 30, 2026

microsoft-github-policy-service Bot enabled auto-merge (squash) January 30, 2026 00:44

adityasa reviewed Jan 30, 2026

View reviewed changes

Comment thread docs/query/local_statistics_for_hybrid_search.md

adityasa reviewed Feb 3, 2026

View reviewed changes

Comment thread Microsoft.Azure.Cosmos/src/RequestOptions/QueryRequestOptions.cs Outdated

adityasa previously approved these changes Feb 3, 2026

View reviewed changes

sc978345 previously approved these changes Feb 4, 2026

View reviewed changes

sboshra reviewed Feb 4, 2026

View reviewed changes

Comment thread Microsoft.Azure.Cosmos/src/Resource/Settings/FullTextScoreScope.cs Outdated

sboshra previously approved these changes Feb 4, 2026

View reviewed changes

neildsh added 8 commits February 4, 2026 13:00

Initial change to allow user to switch between local and global stati…

9421a2f

…stics for full text search

Add test infrastructure for full text score local statistics

bf722a6

Add more tests for locally scoped FTS statistics

0fa8c96

Added test plan to design doc

54d6542

Update contracts, enable weighted rank fusion test, add emulator test…

1679201

…s for FulTextScoreScore.Local

Add new emulator test cases for FullTextScoreScope.Local

d5e1f36

Move to query folder

2e68aef

address code review feedback

44ba1c6

neildsh dismissed stale reviews from sboshra, sc978345, and adityasa via 44ba1c6 February 4, 2026 21:00

neildsh force-pushed the users/ndeshpan/ftsLocalStatistics branch from 07a86a1 to 44ba1c6 Compare February 4, 2026 21:00

adityasa approved these changes Feb 4, 2026

View reviewed changes

This was referenced May 5, 2026

deps: Bump the azure-sdks group with 2 updates jppaquet/my.pretty.pipeline#30

Merged

chore(deps): Bump the minor-and-patch group with 4 updates rbrands/robert-brands-com#67

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query: Adds ability to choose global vs local/focused statistics for FullTextScore #5582

Query: Adds ability to choose global vs local/focused statistics for FullTextScore #5582
microsoft-github-policy-service[bot] merged 8 commits into
masterfrom
users/ndeshpan/ftsLocalStatistics

neildsh commented Jan 30, 2026

Uh oh!

Uh oh!

adityasa commented Jan 30, 2026 •

edited

Loading

Uh oh!

Uh oh!

adityasa left a comment

Uh oh!

Uh oh!

sboshra left a comment

Uh oh!

adityasa left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

neildsh commented Jan 30, 2026

Enabling users to choose global vs local/focused statistics for FullTextScore

Why?

What?

How?

Type of change

Uh oh!

Uh oh!

adityasa commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

adityasa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sboshra left a comment

Choose a reason for hiding this comment

Uh oh!

adityasa left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

adityasa commented Jan 30, 2026 •

edited

Loading